rank | frequency | n-gram |
---|---|---|
1 | 21375 | -ا |
2 | 20345 | -، |
3 | 18678 | -ة |
4 | 14772 | -ن |
5 | 11142 | -ت |
rank | frequency | n-gram |
---|---|---|
1 | 7690 | -ها |
2 | 6234 | -ات |
3 | 5695 | -ين |
4 | 5655 | -ية |
5 | 3904 | -هم |
rank | frequency | n-gram |
---|---|---|
1 | 1981 | -تها |
2 | 1392 | -ية، |
3 | 1086 | -يين |
4 | 1055 | -تهم |
5 | 992 | -ها، |
rank | frequency | n-gram |
---|---|---|
1 | 602 | -اتها |
2 | 388 | -اتهم |
3 | 316 | -انية |
4 | 301 | -ارات |
5 | 236 | -يين، |
rank | frequency | n-gram |
---|---|---|
1 | 95 | -انيين |
2 | 87 | -ياتها |
3 | 87 | -انية، |
4 | 76 | -راتها |
5 | 67 | -اتها، |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings